How well can one jointly measure two incompatible observables on a given quantum state? 
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Heisenberg's uncertainty principle is one of the main tenets of quantum theory. Nevertheless, and despite its 
fundamental importance for our understanding of quantum foundations, there has been some confusion in its 
interpretation: although Heisenberg's first argument was that the measurement of one observable on a quantum 
state necessarily disturbs another incompatible observable, standard uncertainty relations typically bound the 
indeterminacy of the outcomes when either one or the other observable is measured. In this paper, we quantify 
precisely Heisenberg's intuition. Even if two incompatible observables cannot be measured together, one can 
still approximate their joint measurement, at the price of introducing some eiTors with respect to the ideal 
measurement of each of them. We present a new, tight relation characterizing the optimal trade-off between 
the error on one observable versus the error on the other. As a particular case, our approach allows us to 
characterize the disturbance of an observable induced by the approximate measurement of another one; we also 
derive a stronger error-disturbance relation for this scenario. 



The discovery and development of quantum theory have 
generated passionate debates amongst its founding fathers. 
The surprising features of the theory — e.g., its probabilistic 
nature, its uncertainty principle [1] or its nonlocality \2i, (M — 
were indeed too counter-intuitive to satisfy all physicists: Ein- 
stein, for instance, famously argued that "God does not play 
dice" iQ], and could not accept the apparent "spooky action at 
a distance" |50 that seemed to be allowed by the theory. Inter- 
estingly, it has since then been realized that what first seemed 
to be limitations of the theory — the impossibility to perfectly 
predict measurement outcomes and to explain them with local 
hidden variables — can turn out to allow for useful applications 
for information processing, such as quantum cryptography for 
instance 1Q| . With the advent of quantum information science, 
it becomes all the more essential to clarify what can or cannot 
be done quantum mechanically. 

The well-known uncertainty principle is typically expressed 
in terms of "uncertainty relations". To fix the notations, let us 
define the standard deviations AA, /S.B of two observables A 
and B in the state ji/;) as 



AA = {^\{A-{A)f\^) 
AB = (V;| (B - {B)f KO 



1/2 
1/2 



(1) 

(2) 



with {A) = (-0|A|V') and {B) = (t/'I-BIV'), and the "value" of 
the commutator [A, B] = AB — BA in the state \i/j), divided 
by 2i, as 



Cai 



2i 



,{nA,Bm. 



(3) 



Robertson's well-known uncertainty relation ^ then imposes 
that 



AAAB > \Cab\ 



(4) 



Such uncertainty relations are often wrongly interpreted — 
even, historically, by some of the most illustrious authors 181- 
[llll — as saying that one cannot jointly measure the observ- 
ables A and B on the state \tp) when Cab t^ 0, or that the 
measurement of one observable necessarily disturbs the other 



Although this last observation corresponds indeed to Heisen- 
berg's intuition jTI], this is actually not what standard uncer- 
tainty relations imply, let-alone quantify UM- Rather than re- 
ferring to joint (or successive) measurements of two observ- 
ables on one state, they indeed bound the statistical deviations 
of the measurement results of A and B, when each measure- 
ment is performed many times on several independent, identi- 
cally prepared quantum states. 

In this paper we aim instead at precisely quantifying 
Heisenberg's original formulation of the uncertainty princi- 
ple. Even if two observables A and B are incompatible and 
can indeed not be jointly measured on a state |f/;), it is still 
possible to approximate their joint measurement. How good 
can such an approximation be? What is the optimal trade-off 
between the error induced on the measurement of A and the 
error on B7 Between the error in the approximation of one ob- 
servable and the disturbance implied on the other? We answer 
these questions below by deriving new, tight error-trade-off 
and error-disturbance relations. 



APPROXIMATE JOINT MEASUREMENTS 

Let us start by setting up our general framework for ap- 
proximate joint measurements. Our presentation is inspired 
by those of Refs. lo - HSll . and is restricted here to the basics; 
more details are given in the Supporting Information (Part A). 

In order to approximate the measurement of an observable 
A on a quantum system in the state \ip) (in some Hilbert space 
H), a general strategy consists in measuring another, "approx- 
imate" observable A, possibly on an extended Hilbert space — 
i.e., on the joint system composed of the state \tp) e H, and 
of an ancillary system in the state |^) of another Hilbert space 
/C. In this picture, the impossible joint measurement of two 
incompatible observables A and B on jV') can thus be approx- 
imated by the perfect joint measurement of two compatible 
(i.e., commuting) observables^ and S on |-0,^) = \ip)<E)\£,) G 
T-L^JC. Note that in full generality, we do not assume a priori 
(for now at least) that A and B must have the same spectrums 
as A and B. 



Following Ozawa fTslHsl - USll . we characterize the quality 
of the approximations A and B of A and B, respectively, by 
defining the root-mean-square (mis) errors 



eA = (V,^|(^-A®1)2|V;,0 
eg = (V,ei(S-S®l)'lV',0 



1/2 
1/2 



(5) 
(6) 



These rms errors, which generalize standard definitions in 
classical estimation theory II19I1 . quantify the statistical devia- 
tions between the approximations A and B, and the ideal mea- 
surements of A and B. We refer to Refs. iH H mim for 
discussions on the motivations and appropriateness of such 
definitions. There has been a controversy 11221 12311 on the 
question whether these quantities were experimentally acces- 
sible; two different indirect methods have nevertheless been 
proposed lUalZillj and recently implemented ll24ll25ll . 



ERROR-TRADE-OFF RELATIONS 
FOR JOINT MEASUREMENTS 

The fact that quantum theory forbids perfect joint measure- 
ments of incompatible observables implies that the rms er- 
rors {ea, £b) can in general not take arbitrary values. Some 
limitations on their possible values have been obtained previ- 
ously III3L USl 1 1 6. .26.-.28.1 . which we review below. For histor- 
ical reasons, such limitations are often referred to as "uncer- 
tainty relations" (for joint measurements). We will keep this 
terminology when we refer to previously derived relations; 
however, since such relations are not strictly speaking about 
uncertainty but about errors in the approximation of joint mea- 
surements, we will prefer the terminology "error-trade-off re- 
lations (for joint measurements)". 



The Heisenberg-Arthurs-Kelly relation 



derived by Arthurs and Kelly 112611 (for position and momen- 
tum measurements — it was generalized to arbitrary observ- 
ables by Arthurs and Goodman luTIl ). we will call it below the 
Heisenberg-Arthurs-Kelly relation. 

This relation was indeed proven to hold, under some restric- 
tive assumptions on the approximate joint measurements iflSl 
[la , I26I - I28I1 ; namely, it holds when A and B are such that the 
mean errors {ij,(\A~ A®i\i},^) and {ij,^\B ~ B®t\i),() 
are independent of the state \ip). As we are only interested 
here in one particular state |V'), for which we may want to 
adapt our approximation strategy, such an assumption is quite 
unsatisfactory for our purposes: we indeed aim at character- 
izing the trade-off between e^ and e^ for all possible approx- 
imate measurements, in which case the Heisenberg-Arthurs- 
Kelly relation ^ does not generally hold jll 



Ozawa's "uncertainty relation" 

Only recently did Ozawa show lITsil how one could derive 
a universally valid "uncertainty relation" for joint measure- 
ments, by adding two additional terms to the left-hand-side of 
Eq. (|7]i. His relation writes 



eAee + ABeA + AAee > \Cai 



(8) 



(We note also that a very similar but inequivalent relation was 
derived by Hall [14], which involves the standard deviations 
A^ and AB rather than AA and AB; see Supporting Infor- 
mation, Part D for a discussion.) 

The three terms in Ozawa's relation come from three in- 
dependent uses of Robertson's relation dU to different pairs 
of observables. While this indeed leads to a valid relation 
and allows one to exclude a large set of impossible values 
i^Ai £b)i this is not optimal, as the three Robertson's rela- 
tions (and therefore Ozawa's relation) in general cannot be 
saturated simultaneously. 



In his seminal paper JTlJ, Heisenberg argued that the mea- 
surement of the position g of a particle necessary implies a 
disturbance rjp on its momentum p, and that this disturbance 
is all the more important as the precision of the measurement 
of q is large (or as the "error" eq is small) — so that e, ?/p ^ h, 
where h is the Planck constant. 

The formalization of Heisenberg 's intuition rapidly lead to 
the derivation of general uncertainty relations in terms of stan- 
dard deviations [as in dUi] rather than of error and disturbance. 
Nevertheless, it is commonly believed that a similar relation 
to Robertson's should also restrict the possible values of the 
errors eA and eg on A and B in an approximate joint mea- 
surement, in such a way that 



<^A <^B > \CAi 



(7) 



Although it is debatable whether this is really how the claims 
in Im should be interpreted and generalized, this relation is 



A new, tiglit error-trade-off relation for joint measurements 

Using a general geometric inequality for vectors in a Eu- 
clidean space (Lemma [T] of the Methods below), one can im- 
prove upon the sub-optimality of Ozawa's proof, and derive 
the following error-trade-off relation for approximate joint 
measurements: 



AB 



2 ,2 



A^ 



2 ,2 



2^ AA^AB^ 



Cl u eA<^B > Cj 



-AB 



AB : 

(9) 



or, in its dimensionless version — when AA, AB ^ 0, with 

^A = i^, ee 



- — ^" - xfc and Cab = ^^ 



AAAB- 



;2 



;;2 



^a/I ^^ab ^-A^b > Cab- 



(10) 



The proof is detailed in the Methods section. It can easily be 
checked (see Supporting Information, Part D) that Ozawa's 
relation (O can directly be derived from our new relation (0. 
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FIG. 1 : Error-trade-off and error-disturbance relations. The fig- 
ure illustrates (in the case Cab = 1) how the different eiTor-trade- 
off and eiTor-disturbance relations (|7], (|8], (I9^jl0l and (II 2t restrict 
the possible values of the normalized rms errors e^i, eg. Contrary 
to the Heisenberg-Arthurs-Kelly relation (|7), Ozawa's relation ^ is 
always valid; however, it does not fully characterize the whole set of 
forbidden values for (Ia, eb) (dark shaded area), which is precisely 
delimited by our new relation (l9tilOt. Imposing the same-spectrum 
assumption can imply strictly stronger constraints, such as Eq. J12t 
for the case where A'' = B^ = 1 and (A) = (B) — 0: more values 
of (e^, eg) = (e^, 773) are forbidden (light shaded area). 
The theoretical values expected from the experiment of Ref. 12411 are 
also shown; they do not saturate ( I12t . except for e^ = or eg = 0. 
On the other hand, in an ideal implementation the experiment of 
Ref. 12511 would saturate our inequality l ll2t . 



Interestingly, one observes in particular that Ozawa's relation 
remains valid even if one drops the term e_4 eg — ^precisely 
the term that appears in the Heisenberg-Arthurs-Kelly rela- 
tion (|7]i. 

Not only is our relation stronger than Ozawa's, it is actually 
tight, for any A, B and j?/'), any values (e^, eg) saturating in- 
equality (I9UTOI1 can be obtained. This can even be achieved 
by projective measurements on |?/'), without introducing any 
ancillary system: see Supporting Information (Part C) for ex- 
plicit examples. Hence, contrary to previously derived rela- 
tions, our new one does not only tell what cannot be done 
quantum mechanically, but also what can be done. 

Figure [T] illustrates the constraints imposed by the three 
error-trade-off relations (0, dS) and (0-[TO), in the plane 
(e_4, ee). Our new relation (fTOt thus characterizes precisely 
the optimal trade-off between e^ and 1b in the general context 
of approximate measurements. The values below the thick red 
curve cannot be reached, while all values on and above the 
curve can be obtained, by tuning the actual measurements A 
and B depending on how well one wants to measure one ob- 
servable, at the expense of increasing the error on the other. 



The error-disturbance scenario 
and the same-spectrum assumption 



Let us now consider a special case of our general frame- 
work for approximate joint measurements: that of the error- 
disturbance scenario, as first discussed by Heisenberg |jl]|. 

In this context, one considers the disturbance r\s in the 
statistics of one observable, B, due to the unsharp measure- 
ment of another observable, A. The latter is typically ap- 
proximated by the measurement Ma of a probe (or ancil- 
lary system, in the state |^)), which interacts with the state 
IV') via a unitary transformation U lUSll . In such a case, 
the approximation of A corresponds to the measurement of 
.4 = t/^(]l ® Ma)U on \ip,£,), while the perturbed measure- 
ment of B after the interaction with the probe corresponds to 
the measurement of B = W{B <^ t)U (note that A and B 
commute). This error-disturbance scenario can be cast into 
the same formalism as our joint measurement framework; the 
rms error e^ is now interpreted as the rms disturbance ?/g of 
B, with formally the same definition lUSll : rjB = eg as defined 
in®. 

Any error-trade-off relation derived in the more general 
framework of joint measurements thus remains valid in this 
error-disturbance scenario. In particular, when interpreting eg 
as the rms disturbance 775, Ozawa's relation (|8]l writes: 



e^ ?/g + AS e^ + A^ ?7g > \Cab\ 



(11) 



This error-disturbance relation was actually introduced by 
Ozawa before its previous version (|8) for joint measure- 
ments II13I1 . In a similar manner, our new error-trade-off re- 
lation (l9mob also implies a new error-disturbance relation, by 
simply replacing eg by ?7g. 

The difference with the previous, more general scenario of 
joint measurements is however not merely in the interpretation 
of eg. A crucial point is that B ~ W{B (E) i-)U now has the 
same spectrum as B; it is furthermore typically (but often im- 
plicitly) assumed in the error-disturbance scenario that Ma, 
and hence A = UH^ <8i Ma)U, also has the same spectrum 
as A III3II21I I24II25II . Because of these constraints, one may 
expect stronger restrictions on the possible values of (e^, r^g) 
to hold, and that stronger "error-disturbance relations" can be 
derived. (For simplicity, and by abuse of language, we call 
error-disturbance relation any error-trade-off relation derived 
under the same-spectrum assumption, as this is the crucial dif- 
ference between the two scenarios.) 

To illustrate this, let us now restrict our study to the case of 
dichotomic observables A, B with eigenvalues ±1 (such that 
A^ = B'^ = I), and to states \ip) for which {A) = (B) = 
(which implies /S.A = AB = IXas considered for instance 
in the experiments of Refs. ll24ll25ll . We show in the Meth- 
ods section that in this particular case, and with the same- 
spectrum assumption (hence, A^ = B^ = i as well), an anal- 
ogous relation to (l9mo]i holds, where e_4 and eg are replaced 



by e^\/l - -f andi]B\^l 



^, respectively: 



(-¥)-^^(-^) 



2\n-ci 



1b ^^^f^ ^^\[^^ ^ Cab- (12) 

This new error-disturbance relation is strictly stronger 
than (IgmUt (and than Ozawa's relation (fTTTi). Furthermore, 
we show in the Supporting Information (Part C) that it is tight 
when I (V^ I ^-B I "0)1 = 1: for any A,B and |V') satisfying the 
constraints above, one can reach any values (e^, 7^^) that satu- 
rate the inequality, using approximate measurements such that 
^2 — g2 _ I jjjg constraint that inequality ( fTSl i imposes on 
the possible values of (e^, r^g) is also illustrated on Figure [T] 
note that contrary to our error-trade-off relation (Pl-ITOl. in- 
equality (fTZt also bounds the possible values of (e^, 775) from 
above (see also the inset of Figure S 1 in the Supporting Infor- 
mation). 

Let us finally mention that if one imposes the same- 
spectrum assumption on B only (e.g., if one does not impose 
that Ma in the specific error-disturbance scenario considered 
above has the same spectrum as A), one can also derive a sim- 
ilar, tight error-disturbance relation (under the assumptions 
now that _B^ = 1 and {B) = 0), where only eg in dgHTOli 



is replaced by 77^ v 1 — ^; see Eq. (S20) in the Supporting 
Information (Part B). 



Interestingly, e^ and eg are independent of the polar angle 
9 of rh. Note in particular that one can thus have for in- 
stance Cyi = even when A is quite different from A; also, 
when 111 comes close to the north or south poles of the Bloch 
sphere, one can have arbitrarily close projection directions 111 
leading to quite different values for e^ and eg — in our case, 
(e^, eg) = (0, sin 0) for ip ^ (j)a and (e^, eg) == (sin 0, 0) for 
(f = (pb- These somewhat unexpected properties might how- 
ever only be artefacts of the particular definitions of errors we 
use; it would be interesting to investigate possible alternative 
definitions that do not exhibit such behaviours. 

Let us now impose that A and B have the same spectrum 
as A and B; i.e., since A and B are here ±l-valued observ- 
ables, A^ = A'^ = B"^ = B^ = i. Assuming furthermore 
that Oa = Ob = f , we have (A) = {B) = 0. Inequality O 
then applies; it can be saturated in the error-disturbance sce- 
nario [with A = W{1 ® Ma)U and B = W{B ® l)U] 
in the following way: let cr^ ~ cosip ax + sim^ cry, for 



</5 € [0a, (/>()] (i.e.. 



= rh • (7 with 9 = ^), and let \'nv^) 



be its (normalized) eigenvectors, corresponding to its eigen- 
values ±1; we then define Ma ~ (Jip, U = (C/r® ]l)-t/copy 



with C/copy a unitary such that Ucopy\m.^,0 

Ucopyl'ITT'ip 



= \m^,rn^) (e.g., with |0 = \m+ 

unitary II29I1 in the {|77iJ)} basis), and Ur — e~*~T~ 
then gets 



\m+,m+) and 



m^), a CNOT 



e^ 



2 sin 



f- 



jyg = 2 sin 



f 



'^^: one 



(15) 



Example: qubits 

As an illustration of our error-trade-off and error- 
disturbance relations (|91JT0]| and (fT2t . let us consider the sim- 
plest case of qubits. We choose |t/') to define the north pole 
of the Bloch sphere, and let ^ = a • <? and B ^ h ■ a 
(where a = (ctx, ctyjUz) denotes a vector composed of the 
3 Pauli matrices) be two ±l-valued qubit observables charac- 
terized by unit vectors a and b on the Bloch sphere, of po- 
lar and azimuthal angles 9a, 4>a and 9b, (f>b, respectively. We 
take 9a, 9b £ [0, tt] and we assume, for convenience, that 

Cj) = cPb - (t>a e [0, f ]. 

For such a choice of \tl'), A and B, one finds AA = sin 9a, 
AB = sin 9b, and Cab = sin 9a sin 9b sin (p. Equation (flOl l 
then writes, for sin 9a sin 9b 7^ 0: 



e'A 



2 cos e^ eg > sin 



(13) 



One can check that this error-trade-off relation can simply 
be saturated by defining A and B to be projective measure- 
ments in the same eigenbasis, specified by any unit vec- 
tor rii on the Bloch sphere with polar and azimuthal angles 
9 G]0,7r[ and ip E [(f)a,4>b]- More specifically, for A = 
[(cos 6'a— cosflrii • a)l + (rii • a— cos 0a cos0)m • (T]/sin^0 
and B ~ [(cos0(,— cos m • b)l + (m • b— cos^bcosfl)!!! • 
(?]/ sin^6', one obtains 



^A 



m\{(p~4>a), eg 



1(06-.^). 



(14) 



On the recent experimental tests ofRefs. 1241 and jZ 

Two experiments were recently reported, showing a vi- 
olation of the Heisenberg-Arthurs-Kelly relation (|7) (more 
specifically, of its error-disturbance version, where eg is re- 
placed by r/g) and a verification of Ozawa's error-disturbance 
relation (fTTT i in qubit systems. 

The first experiment t24ll measured neutron spins, using the 
indirect method proposed in lUSll to estimate the rms errors 
and rms disturbances e^, 77g. A = CTx was estimated from the 
measurement of a^ = cos (p (Tx + sin ip ay on \^) ~ | + z) 
(the eigenstate of ctz, corresponding to its eigenvalue +1), 
and was followed by the measurement of i? = a^; note that 
A^ = B^ = t, (A) = (B) = 0, and Cab = 1. The expected 
theoretical values for the rms errors and rms disturbances w ere 
eyi = 2 sin ^ and ijis = \/2 cos ip. These are plotted on Fig- 
ure [T] one can see that they are not optimal as they do not 
saturate our tight error-disturbance relation (ITZl . From the 
analysis above, it appears that adding a rotation C/r before the 
measurement of B would however be enough to allow the ex- 
perimental setup used in [241 to saturate inequality (fT2t . 

The second experiment |25] measured the polarization of 
single photons, using weak measurements as proposed in 1I21I1 
to estimate the rms errors and rms disturbances. A was ap- 
proximated from a measurement of variable strength based on 
a CNOT unitary. Because the weak measurements used to es- 
timate EA and ?/g are not infinitely weak, they slightly perturb 



the state of the photon, adding some noise. However, in an 
ideal implementation the experiment of lIZSIl would saturate 
the bound of our new error-disturbance relation (ITZI . 

To finish with, let us however emphasize that no experi- 
ment will ever demonstrate the universal validity of an "un- 
certainty relation" (or error-trade-off, or error-disturbance re- 
lations), despite what the title of Ref. ll24r suggests. First note 
that in order for such experiments to be conclusive, one needs 
to perfectly trust the implementation; otherwise, systematic 
errors in the preparation of \i/j) or in the estimation procedure 
for e^ and eg could radically change the values of the dif- 
ferent terms in the relation, leading to unjustified conclusions 
(and possibly even "showing" a violation of a valid relation!). 
All one can do is then to check that in that particular (perfectly 
trusted) implementation, for some particular A, B and |i/;) and 
for the particular approximations A and B implemented in that 
experiment, the error- trade-off or error-disturbance rel ation of 
interest is satisfied. There is indeed no way experimentally 
to test all possible approximate joint measurement strategies, 
and the part icular choice of A and B could be non-optimal (as 
e.g. in i24ll ). It is of course trivial to obtain data satisfying 
an error-trade-off relation, if one does not try to optimize the 
values of [cai^b)'- if the relation is universally valid, then 
any measurement strategy (e.g. outputting random results) 
will satisfy it! One can even similarly trivially violate the 
Heisenberg-Arthurs-Kelly relation (|7]i, e.g. by actually mea- 
suring A perfectly (so that e^ = 0), and outputting any values 
to approximate B (as long as eg < oo). What is less trivial 
and therefore more interesting is to show experimentally that 
a tight error-trade-off or error-disturbance relation can indeed 
be saturated. 



DISCUSSION 

We have presented a new, state-dependent error-trade-off 
relation [Eqs. (l9]-[T0l. in the general framework of approxi- 
mate joint measurements. Our relation is universally valid, 
whether the Hilbert spaces of interest are of finite — as in our 
qubit example — or infinite dimensions (provided \ip,£,) is in 
the domains of A^^\ B'^'^\ A^^\ B^^\ and of all their products 
that are involved in the proof of (|91JT0]|) — e.g. for the mea- 
surement of position and momentum, as first considered by 
Heisenberg. Note also that although the framework for joint 
measurements was presented for pure states, it can easily be 
generalized to mixed states, and Eqs. (l9]-[T0li still hold. Impor- 
tantly, our new error-trade-off relation was shown to be tight, 
and therefore to fully characterize the whole set of possible 
values of rms errors (e^,eg) (in the case of pure states; our 
relation may in general not be tight for mixed states). This an- 
swers the question posed in the title, for pure states and when 
the quality of the measurement is quantified by these rms er- 
rors. 

Error trade-off relations imply error-disturbance relations 
as a particular case. However, because of the same-spectrum 
assumption, strictly stronger relations can in general be de- 
rived in the error-disturbance scenario; we presented an ex- 
ample of such an error-disturbance relation, for ±1 -valued 



observables with (A) — (B) — 0, allowing us to highlight a 
quantitative difference between the two scenarios. The deriva- 
tion of a more general relation under the same-spectrum as- 
sumption is left for future work. 

Our relations apply to the projective measurement of two 
observables A and B. It would be interesting to see if these 
could be generalized to some POVMs (see lll4ll2C)ll . however, 
for the difficulties encountered), or to more observables 113011 . 
In the error-disturbance scenario, it may also be desirable to 
quantify the disturbance of the quantum state directly, rather 
than of the statistics of another observable; this is left as an 
open problem. 

As highlighted above, our relations bound the rms errors of 
A and B, as defined in Eqs. (|5]-l6ll. In the context of quan- 
tum information, one may however prefer to use information- 
theoretic definitions for the quality of approximations. De- 
veloping such definitions, and deriving corresponding univer- 
sally valid and tight error-trade-off or error-disturbance rela- 
tions would certainly be an interesting direction of research. 
This may indeed give a clearer operational meaning to such 
relations, and would be more adapted to their use in possi- 
ble applications (in the same way e.g. as entropic uncertainty 
relations are useful to prove the security of quantum crypto- 
graphic protocols 1I31II32II ). This will involve radically differ- 
ent proof techniques, which may also allow one to consider 
error trade-offs in general probabilistic theories, not re stricted 
to quantum theory and to its Hilbert space formalism. This 
will undoubtedly give more insight on the still puzzling, multi- 
faceted uncertainty principle. 



METHODS 

In order to prove below our error-trade-off and error- 
disturbance relations (l9]-[T0li and (fT2l i. we start by introducing 
two general inequalities for real vectors. 



Lemmas: Geometric Inequalities 

Let d, b be two unit vectors of a Euclidean space £, and let 
us define x = d • 6. We prove in the Supporting Information 
(Part B) the following lemmas: 

Lemma 1. For any two orthogonal vectors x andyofS, one 
has 



\\a-xr + \\b-yr + 2^1^\\a-x\\\\b-y\\ > x' ■ 

(16) 

Lemma 2. For any two orthogonal unit vectors x and y of£, 



defining a± — ^\ — (a ■ x)'^ and b± — -J 1 — {b ■ y)"^, one 
has 



a\ + b\ + 2^l~X^a^bi_ > 



X 



(17) 



\\b-y\\ = h 



a-x\\ = Ea 




\\b-y\\ = eg 



a-x\\ = Ea 




FIG. 2: Geometric construction used in the proof of our gen- 
eral error-trade-off relation f9Hi0t. The real vectors a, b, x 
and y satisfy the assumptions of Lemma [T] the particular choice 
of vectors illustrated here (for which x = siiKf} — sm{(t>b~(t>a), 
||a — a;|] = sin((p — (j)a) and ||fe — yH = sm{(j}b — <^)) saturates 
inequality ( 116b . which quantifies the optimal trade-off between the 
distance from the unit vector a to an axis along a direction x, and 
from the unit vector b to an axis along a direction y, orthogonal to x. 



FIG. 3: Geometric construction used In the proof of our error- 
disturbance relation flU (for the case where A^ — B^ = A^ = 
B'^ — t and (A) — (B) = 0). The real vectors a, b, x and y 
satisfy the assumptions of Lemma|2l the particular choice of vectors 
illustrated here (for which x = sinijf) = sm{4)b—4>a), \\a — x\\ = 



2 sin 



■/J-^g 



and 116 



y\ 



— 2 sin 



•i>b-v 



) saturates inequality ( I17t . 



which quantifies the optimal trade-off between the distance from the 
unit vector a to another unit vector x, and from the unit vector b to 
another unit vector y, orthogonal to x. 



Proof of our error-trade-off relation f9Hi0t 



Let us now define, in the non-trivial case AA AS > 0, the 
ket vectors 

la) = —, H',0, \b) = T^ |V',0> (18) 



AA '^'^" ' ' AB 



(19) 



By writing these vectors in any orthonormal basis of % ® K- 
(e.g., the common eigenbasis of A and E), and denoting by 
Re and Ini their real and imaginary parts, respectively, one 
can define the following real vectors: 



X = 



Re I a) 
Im|a) 

Re I a;) 
Imla;) 



b = 



, y = 



lm\b) 
-Re|6) 

Im|y) 
-Re|y) 



(20) 
(21) 



a 



One then has 

2 = (Re I a) )"^- (Re I a)) + (Im|a))"^-(Im|a)) = {a\a) = 1, 



\br = 1, 



(22) 
(23) 



((x|-(a|)(|x) 
'A-A(^ 



(^,^l(- 



-|a)) 

1\2 



\\y-b\\' 



AB^ 



AA 



^B^ 



IV', = 



A^2 



el, (24) 



(25) 



a ■ b = 



x-y 



1 mA,Bm 

2i AAAB AAAB 

1 {i;,S,\[AM^,i) 



(Ini|a))"^- (Re|6)) =Im(a|6) 
Cab 



— Cab, 



2i 



AAAB 



0. 



(26) 



(27) 



Hence, the (normalized) rms errors e^, eg can be interpreted 
as distances between vectors iIitIuOIi . while the commutativ- 
ity of A and B translates into an orthogonality condition for x 
and y. 

The vectors a, b, x and y thus satisfy the assumptions of 
Lemma[T] that ||a|| = ||fe|| = 1 and x ■ y = (see Figure|2]i. 
Together with Eqs. (|241426] |. inequality ( fTSI l implies our gen- 
eral error- trade-off relation for joint measurements (ITOl . After 
multiplication by AA^AB^, we obtain Eq. (|9]l (for which the 
case AA AB = is trivial, as it implies Cab = 0). 



Proof of our error-disturbance relation ([12), for the case where 



B' = A' 



B' 



1 and {A) = (B) = 



With the assumptions that A^ ^ B'^ = 1 and (A) = {B) = 
(hence AA = AB = 1), and that A and B have the same 
spectrum as A and B (hence A^ = B^ = i), the real vectors 
a, b, X {— x) and y (= y) defined as in (12014211 1 are now such 



that (with ets = tjb in the error-disturbance scenario) 



\b\\ 



(28) 



The vectors a, b, x and y thus satisfy the assumptions of 
Lemma |2] (see Figure |3]l. Together Eqs. ( |29H3TT i. inequal- 
ity (T7\ gives our error-disturbance relation (ITZI . 



1- (a-xf = \\x-af(l 



a ■ b = Cai 



bi^l-ib-yr=rf^[l-'Jf 



X ■ y = 0. 



^\n 



4 
(29) 

(30) 
(31) 
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SUPPORTING INFORMATION 



This Supporting Information starts by giving more details on our general framework for approximate joint 
measurements (PartlAt. We then prove the two geometric Lemmas used in the proofs of our error-trade-off 
and error-disturbance relations (Part |B], and prove the tightness of our relations (Part|C). We finally show 
explicitly how Ozawa's "uncertainty relation" for joint measurements follows from our new error-trade-off 
relation (PartlDt. 



A. MORE ON APPROXIMATE JOINT MEASUREMENTS 

In the main text we have presented our general framework 
for the approximate joint measurement of A and B on \ip) 
by introducing two commuting observables A and B to be 
measured on the state \4',0- We give below an alternative 
description in terms of Positive Operator- Valued Measures 
(POVMs) [Sly, which justifies the generality of our frame- 
work; we then discuss how to optimize the quality of the ap- 
proximations, for a given POVM or a given projection eigen- 
basis. 



1. General strategy for approximate joint measurements 

In quantum theory, the most general strategy to get some 
information on a quantum system in the state \ip) € "H is 
to perform a POVM; we denote such a general' POVM by 
M = {Mm}, where its elements M,„ are Hermitian, positive 
operators acting on H, with X^m^"' = 1- In order to ap- 
proximate the measurements of two observables A and B on 
\ijj), one can then use the results of the POVM M in the fol- 
lowing way; for each of its possible outcomes m, we define 
some real value /(m) that aims at estimating the result of an 
ideal measurement of A, and some real value g{m) that aims 
at estimating the result of an ideal measurement of B (where 
without loss of generality, f{m) and g{m) can be assumed to 
be deterministic functions). 

Now, from Neumark's extension theorem |S2l, any POVM 
can equivalently be represented as a projective measurement 
on an extended Hilbert space. For convenience, we can thus 
assume that M corresponds to the measurement of a Hermi- 
tian observable Ai = X]m™l'^)('^l ^'^ '■^^ ^'■^'■^ I'^/'iO = 
IV") ^ 10 > where |0 £ /C is the state of an ancillary system. 
In this picture, the projection of \ip,^) onto a (normalized) 
eigenvector \m) £ Ti IC gives the eigenvalue m of A^, cor- 
responding to the outcome m of the POVM M. The approxi- 
mation strategy for A and B described above, using the output 



values /(m) to approximate the measurement of A and g{m) 
for that of B, then actually corresponds to the (joint) measure- 
ment of the observables ^ ~ J2m /("^) |'7i)(?7i| = fi-M) and 
B = X]m5('^) l'7^)(n^| = g{-M) on \ip,£,). These two com- 
muting observables correspond precisely to the observables A 
and B considered in the main text. 



2. Quality of the approximations 

To characterize the quality of the approximations of A and 
B, we have introduced in the main text the rms errors such 
that 



2 


= (V',ei(^-^®i)'lV',o, 


(SI) 


4 = 


= (^,ei(6-i?®ii)2|v,0- 


(S2) 



It is worth noting that e^ and eg do not depend on the partic- 
ular, non-unique Neumark extension (specified by the Hilbert 
space /C, 10 G ^ and the eigenvectors \m) E T-L^JC) chosen 
for the POVM M. Using the definitions A^Y. /(m)|m)(m| 
and B = ^ g{rn)\'m){'m\, and the fact that for all m, M^ = 
1 (g) (0 ■ \m){m\ . 11 (8) 10' the rms errors can indeed be directly 
expressed in terms of the POVM elements M,„ and of the 
functions f{m) and g{m) as follows IIS3l[S4ll : 



^A 
,2 



Y,{^\{A-f{m))Mm{A~f{m))\^), (S3) 

m 

Y,{i^\{B-g{m))M^{B-g{m))\^). (S4) 



3. Optimal clioice for f{m) and g(rn) 



' Note that we do note make any restrictive assumption on the possible 
POVM outcomes: there can be an arbitrary number of different real val- 
ues m, and these could even be continuous, in which case sums would 
need to be appropriately replaced by integrals. 



For a given POVM M or a given projective measurement 
of I")/', onto an eigenbasis {|m)}, one can choose the output 
functions f{m) and g{m) so as to optimize the quality of our 
approximations (i.e., minimize e^ and e^)- 

Developing Eq. ( ISlb with A = Ylim /("^) \m){m\, one in- 



9 



deed finds^ flSJ 



^\ = (^'> - E ^(™) Re 



m|p(rn)>0 



(m|y4(8)]l It/',^ 
(m|7/;,^) 



?n|p(rn)>0 



^ p(m) I /(m) — Re 



{m\A®t 1^,0 
(m|i/',0 



(S5) 



with p(Tn) = |(7n|-0,^)p (and where the sums are over the 
normalized eigenvectors \m) of A^ for which p{ra) > 0). 
The value f{m) only contributes to the last sum in ( IS5b . It 
thus appears clearly that in order to minimize the rms errors 
e^ and e/j, the optimal values for f{m) and similarly for g{m) 
are, for each possible outcome m. 



fopt{m) = Re 
gopt{m) = Re 



(m|V',0 



(S6) 
(S7) 



Interestingly, one may recognize above that the optimal val- 
ues fopt{fn) and goptijn) are given by the real parts of the 
so-called weak values 115511 of the observables A®t and B®t, 
respectively, pre-selected in the state IV', ^) and post-selected 
in the state \m). This thus provides an interesting interpre- 
tation to the real part of weak values as the optimal approxi- 
mation of an observable, when the criterion for optimality is 
taken to be the rms error llS3ll . 



For this optimal choice of output functions / = fopt and 
.9 = 9opt, one then gets, from (IS5b and from the fact that 

[a^) = ^,^^ (?/>,^|A(g)ll|m)(TO|A(g)]l|V;,^), where the sum 
now runs over all normalized eigenstates \m) of M.: 



f-AUopt) = 2^ p{m) (Im 



m\p{m)>0 

+ Y. l("i|^®l|^,OI' (S8) 



|2 



7n|p(rn)— 



i{9opt) 



rn\p(m)>0 



p(rn) Im 



{m\B®l\i^,Cl 



(m|i/',0 



- Y. I(™|S®1|V',0I 

??i|p(rn)— 



(S9) 



which now involve the imaginary parts of the weak val- 
ues iSql . together with some terms related to eigenvalues ra 
for which p{m) = 0. 

Note again that fopt{m) and gopti'm) can be expressed in 
terms of the POVM elements M„i directly. Namely; 

(M™A) (M„,B) 

fopt{m) = Re ^^^ ^ , gopt{m) = Re ,,^ , , (SIO) 



(M„) 



(M, 



where (MmA) / (Mrn) and (M„ii?)/(M„i) are the standard 
generalisations of weak values for POVMs [STJ). However, 
in general one has (with now p{m) ~ (M„i)): 

^\{Iopt) = {A^) - E P("^)[Rc(M„A)/(M™)]2 

?TT,|p{m)>0 
m|p(m)>0 ^ ^ '"' ^ m|p(m)=0 

(A/„,)(^A/„.A)-|(M,„A)|2 



+ E 



ra\p{m)>0 



p{m) 



^ E^M(i-wx)'+ E(^^-^™^)' 



i|p(m)>0 



(M„ 



7n|p(7n)— 



(sii) 



4ifopt) = (B^) - E P{m)[Re{MrnB)/{Mr. 

m|p(77l)>0 



> > 79(772) Im 



m\p{7n)>0 



(M„) 



f ^ {BMmB), 

rn|p(m)— 

(SI 2) 



where the inequalities follow from the Cauchy-Schwarz in- 
equality applied e.g. to Mm \ijj) and Mm A\ip). 

Finally, one can easily check that the optimal values 
lopti'm) and gopti'm) derived above are such that the ap- 
proximations of A and B are "unbiased", in the sense that 
(A) = (A) and (B) = (B) . Let us emphasize however that 
fopt (m,) and gopt {iTi) may not be in the spectrum of A and B. 
While for general joint measurements we indeed do not re- 
quire the approximation functions to output eigenvalues of A 
and B, we nevertheless impose such a constraint in the error- 
disturbance scenario (see main text). In that case, it may not 
be possible for f{m) and g{m) to take the values fopt{m,) and 
gopt{m) given by Eqs. ( IS6US7b or dSlOb : see Part lUI] below 
for an example. 



B. PROOFS OF OUR GEOMETRIC LEMMAS 

We now prove the geometric inequalities of the Methods 
section, which hold for any two unit vectors a, b of a Euclidean 
space £ (with a ■ b ~ x)- Let us start with Lemma|2] 

Lemma 2. For any two orthogonal unit vectors x and y of£, 

defining a± = -^/l — (a • x)^ and b± = -J 1 — {b ■ y)-^, one 
has 



+ bl + 2^1 -X^a^bi_ > x' 



(S13) 



^ In the following, the notation (...) is used for {i/i|...|i/i). The expression 
inside the brackets does not need to be a Hermitian observable; if it is, the 
notation thus denotes its mean value in the state \ip). 



Proof of Lemma^ We use for convenience the notation Qx 



a ■ X. Let us define a\ ' = ^^ 



otherwise, and a]_ 



V^ 



if ixl ^ 1, af = 



if a^ 7^ 0, a 



(x) 



otherwise. 
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so that b — xo. 



X^aj and a; 



,W 



V 1 " A "-i 

aj_'|| = lorO, llaj'^^ll = 1 or 0, and a • a]"^ = a-a|"^ = 
Since x and y are orthogonal unit vectors, we have (b-x)^ 
ib-yf < ||6|p = 1, and hence 



axO. + a±a "i' , with 

(x) 



> 






Xax 



Vl-X^fl-L a^ 



(6) .{x) 



> Ixa.l-v/l-x'ax- (S14) 

It follows that 

[b^ + Vl-X^aj_y > {xa^f = x' (1 - 4), (S15) 



which is equivalent to (IS13b . 

Note that this inequality can only be saturated if d, fe, x and 
y are coplanar: indeed, the first inequality in (IS 14) would need 
to be an equality (which requires b e Span{.T, y}), and a sim- 
ilar constraint would apply to d. D 

We can now use the result of Lemma|2]to prove Lemma[Tl 

Lemma 1. For any two orthogonal vectors x and y of £, one 

has 



\\a~xf + \\b-y\\^ + 2^1^\\a-x\\\\b-v\\ > x' ■ 

(S16) 

Proof of Lemma\J} Let us define x = t^ (if ||£|| 7^ 0; other- 
wise, X is defined as any unit vector orthogonal to y),y = -Jirr 
(if II y II y^ 0; otherwise, y is defined as any unit vector orthog- 
onal to x), and let us use the notations of Lemma|2| a^ — a-x, 



a± = y^l — al, and b± = y 1 — (6 • y)^. One has 

||d — a^ll = II (d — flj;.-!:) + (flj^i: — x) 

II II 2 II II 2 

= d — axXU + uaxX — x\\ 

\ II - * l|2 1 2 2 

> a — arX\\ = i — a,_ = a 



(S17) 



and similarly, ||& — y|P > &1- Therefore, 

||d - f IP + ||6 - y||2 + 2v/l-x2 ||a - S\\ \\b- y\\ 

> al + bl + 2v/l - X^a^b^ > x', (S18) 

where the last inequality is due to Lemma|2] 

Note here that inequality ( IS16b can only be saturated if 
d, 6, X and y are coplanar and if x is the orthogonal projec- 
tion of a onto the direction x (so that the inequality in dSlTb 
is saturated), and yis the orthogonal projection of b onto the 
direction y. D 

For completeness, let us also introduce the following addi- 
tional Lemma: 




.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 



FIG. S 1 : Constraints imposed by our error-trade-off and error- 
disturbance relations f9tii0). (T2\ and <S20> (shown here for 
Cab ~ sin-j|-). Main figure: the shaded areas below the curves 
are inaccessible to the values of {tA, ee), depending on the assump- 
tions under which the relations are derived (see text). Inset: in the 
case where A^ = B'^ = A'^ ^ B^ ^ I and (A) = {B) = 0, 
inequality ( I12t also bounds the values of (e^, ee) = (e^, r^g) from 
above; similarly, inequality ( |S20t also upper-bounds the values of 
ee = VB- 



Lemma 3. For any two orthogonal vectors x and y of£, such 
that \\y\\ — 1, one has (with b± = J 1 — (b ■ y)'^): 



-2v/r 



X 



>x'. 



(S19) 



The proof of this Lemma follows closely that of Lemma [T] 
above. Inequality ( IS19I ) can only be saturated if d, b, x and y 
are coplanar and if x is the orthogonal projection of d onto the 
direction x. 

Lemmas [T] and 12] were used to prove our error-trade-off and 
error-disturbance relations (l9l-ITffl and (ITZi . respectively (see 
the Methods section). Lemma[3]can similarly be used to prove 
an error-disturbance relation when one does not assume that 
A must have the same spectrum as A. Namely, with the ad- 
ditional assumptions that B^ = B^ = i and (B) = 0, one 
has 



vl 



4 



-'AB ^A Vt3 



1-^ > 

4 - 



^2 
(S20) 



Figure |ST| shows a comparison between the constraints im- 
posed by each of our error-trade-off and error-disturbance re- 
lations (EHIOl), dnii and iSM . 

Let us finally mention that the necessary condition for the 
inequalities ( IS13b , ( IS16I ) and ( |S19t in the three lemmas to 



11 



be saturated may be helpful to inspire the choice of approx- 
imate measurements A and B if one wants to saturate our re- 
lations dMTOli. (O and ( IS20b . A and B should indeed be 
chosen such that the real vectors x (or x) and y (or y) defined 
in Eq. (l2Tl i of the Methods section are coplanar with a and 
b, defined in Eq. ( l20b (and which are fixed by A, B and |V')). 
Furthermore, in order to saturate (l9l4TOl for instance, x and y 
should precisely be the orthogonal projections of a and 5; this 
is indeed ensured when one uses the optimal output functions 
fopti'in) and gopt{m) of Eqs. ( IS6US7b . 



C. TIGHTNESS OF OUR ERROR-TRADE-OFF 
RELATIONS 

We now show that our general error-trade-off relation (0- 
[TOl i is tight, and that so is our relation (fT2l i when A, B and \tjj) 
are — in addition to the assumptions A^ = B^ = A^ = B^ = 
1 and {A) = (B) = 0— such that \{AB)\ = 1. 

In order to do so, we provide in each case explicit examples 
of approximate joint measurements A, B (specified by their 
common eigenbases {|m)}) saturating the bounds. These 
generalize the joint measurement strategies for qubits given 
in the main text. Note however that such optimal approxima- 
tion strategies are not unique. 



1. Tightness of our error-trade-off relation f9tiT0t 

Interestingly, one does not need to use any ancillary sys- 
tem to saturate inequality (|9]-[T0]i. As we show below, in the 
non-trivial case where A^ AB > 0, this can indeed be done 
for instance by performing on \ip) a projective measurement 
Ai with eigenvectors in Spa.n{\^) , A\ip) , B\t/j)} (and where 
the remaining eigenvectors are orthogonal to jV'), A\tl.') and 

We define for convenience the observables Aq ^ [A — 
{A)]/AA and Bo = \B ~ {B)]/AB (such that (Ao) = 
(Bo) = and (A^) = (S§) = 1). Note that from the 
Cauchy-Schwarz inequality applied to ^o|V') ^nd Boltp), one 
has |(^o-So)| < 1 — which corresponds to the Schrodinger 
uncertainty relation liSSJ, and is a stronger version than the 
better-known Robertson uncertainty relation (0). 



Let us consider first the case where K^o^o)! = 1 — in 
which the vectors Ao\tp) and Bo\tp) are linearly dependent — 
and define = arg(^o^o). 0a — — f and (f>b = -f, such 
that {AqBo) = e"^ and Cab = AAABIin{AoBo) = 
A A AB sin 4>. For an angle ip and a real parameter q y^ 0, 
we define 



|rnr) = {t+qe'^'^-'^^^Aom, 
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(1 



'(^'-^)Bo)|V). 



(S21) 
(S22) 



Note that {rrT^\rri^) = 1 - e"*"^(Aoi?o> = 0. One can 
then define unit vectors \mi) and |to2) by normalizing the 
vectors \wr[) and |to^), and complete the orthonormal basis 
{|mi),|m2)}ofSpan{|V),A|V>} = Span{|V), -B|V>} with 



some additional vectors \m) in H, all orthogonal to \ip), A\^) 
and B\ip), so as to obtain a full orthogonal basis for H; we 
then define the observable Al to be a projective measurement 
onto this basis. 

For a given outcome corresponding to an eigenvector \m), 
let us use the optimal approximations for the measurements 
of A and B, i.e., let us output the values /opf(»^^) and ^opt (m) 
given in ( IS6tiS7b . According to Eqs. (ISSJISgl l, this leads to 



^A 



E K^ 



m—7ni,7n2 



Im- 



Am_\ 



= AA' y l^l^^imL h,^S!!^ 



m\ip) J 
mj\Ao[ 



1 / \ 2 

AA^ /Tm«p-H'P-0a) 



1 + 9- 



■AA' 



2 J Im q e 

1 / ^2 



l + g-2 
AA^ sin2(v3-0j 



Img e 



-i(<p-4>a) 



(S23) 



eg can be calculated in a similar way. One thus finds 

eA -= AA |sin((/j - 0a)| , ee = AS |sin(06 - ip)\ , (S24) 

which are independent of q. By varying ip, chosen such that 
cos0sin((^— (/)„) sin((/)f,— (^) > 0, one obtains all values of 
{ea, eg) saturating the bound (|9]-[T0]i. 

Consider now the case where KAq-Bo)! < 1, and define 



Cab 
AAAB 



Im(74o-Bo)i 0' = arcsinx. 



and 



(t^'b = ^- For an angle if e [^ 2 ' 2 1 ^'^'^ '■^^ ^^^^ parame- 
ters q, r, let us define the two complex coefficients 



a 



and the vector 



qcos{ip-4)'^)+ir sui{f-(j)'^), (S25) 
-ip)-iq sin(0^ - ip), (S26) 



r cos 



\rrn) = (l+/3Ao + aBo)|V')- 



(S27) 



Let us fix two more real parameters s,t such that st = 
{WiilrrTi) / {1 — |(Ao-Bo)P)> and define, with the operator 

D = {mT\Bo\yj)Ao - {rrn\Ao\^P)Bo, 

\m) = i{rnT\rm)t+sDM-\rfn), (S28) 
\m^) = {{m\mT)i~tD)\i')-\mT). (S29) 

One can then easily check, using in particular the facts 
that {nH\tp) ^ 1, {m-[\D\i}) = {ip\D\^) = and 
{i:\DW\ij) = (l-|(AoSo)P)((TOr|mr)-l), that the three 
vectors |mr), \7ri2) and \rrT3) are orthogonal. As before, one 
can normalize these, complete the thus obtained orthonormal 
basis {\mi) , Im-z) , Im^)} of Span{|V'), A |i/'), i?| ■)/')} with 
some additional vectors jm) in H (all orthogonal to \tlj) , A\tjj) 
and B\ijj)), and define the observable Al to be a projective 
measurement onto this basis. 
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After some similar calculations as in (IS23l l (although a bit 
more tedious^), one finds again, from Eqs. ( |S8HS9t (i.e. for 
the optimal approximations of the measurements of A and B), 



ej, = AA \siii{ip 



eB = AB\sm{^',-ip)\, (S30) 



independently of q, r, s and t. By varying ip G [— , , 2 ]' 
one obtains again all values of (e_4,ei3) saturating the 
bound (HHinil. 

Let us finally mention the case where A^ AS = (which 
implies Cab ~ 0). In such a case, one can have e^ = eg = 0, 
saturating again the (then trivial) inequality (|9]i: this is indeed 
obtained for instance by defining A ~ (A) and B = B if 
AA = 0,or:A = AandB= (B) if AS = 0. 



2. Tightness of our error-disturbance relation i fTzt 
(valid for A^ = B^ ^ A^ = B^ ^ 1 and (A) = (B) = 0), 

when|(AB>| = 1 

Let us now turn to the case where A and B are assumed to 
have the same spectrum as A and B. In such a case one can 
in general no longer choose the output functions /(m) and 
g{m) for the approximations of A and B to be the optimal 
ones, prescribed by Eqs. (|S6HS7] |. If A and B are dichotomic 
observables with eigenvalues ±1, then /(m) and (7(771) are 
also bound to take values ±1; from Eq. ( |S5t . one can see that 
the optimal choice to minimize e^ and eg is now to choose 



/(to) = sign Re 



^MMY 



A4') )' 



g{m) = sign Re 



(m 



sm 



{m\ip) 
(S31) 



Using dSSl ). this leads to 



^\ = (A2)+^p(TO)/(TO)2-2^p(TO)/(TO)Re^ 



= 2-2^p(to) 
and similarly. 



Re 



(to|A|V') 
(to|i/)) 



4 = 2-2^p(to) 



Re- 



(m\B\ 



i\Am 



(S32) 



(S33) 



When |(j4i?)| = 1, let us use the eigenvectors defined 
from ( |S2lilS22t . with q = ±1. We now find, from ( |S32l - 
IS33T l. 



e^ = J2-2 cos(v?-(/)a) , eg = a/2-2 cos(06-(^) (S34) 



(note that if cos{(p~(t)a) > and cos{(pb^f) > 0, one gets 
e^ = 2 1 sin '^~'^° | and eg = 2| sin '^''~'^ |). By varying 1^9, 
such that cos(/)sin((/3— c/)a) sin(0fc— (p) > 0, one obtains all 
minimal values of (e^, ejj) saturating the bound iT% . 

Furthermore, as can be seen for instance from Figure |ST] 
(inset), inequality (fT2T i also sets upper bounds on the possi- 
ble values of {eaj^b)- By using the same eigenvectors as 
above but changing the signs of /(to) and 5(771) in Eq. ( IS31I ). 
and possibly mixing such strategies, (e_4,eB) can attain all 
possible values along the contour of the region restricted by 
inequality ( fTSI ). This shows that our error-disturbance re- 
lation ^ (valid for A"^ = B^ = A"^ = B^ = t and 
(A) = (B) = 0) is tight when"* 1(^^)1 = 1; its tightness 
in the case | (AB) | < 1 is left as an open problem. 



One may also wonder if inequality (fT2T l remains tight (when 
|(^i3)| — 1) in the specific error-disturbance scenario, where 
one imposes that A and B have the particular forms A = 
W (t (^ Ma)U and B = t/'f(B«)]l)C/ (see main text). The an- 
swer is positive: one can indeed transform the previous mea- 
surement strategy so that A and B are estimated from mea- 
surements on two separate systems. Intuitively, one needs 
to copy some information on the quantum state \t/j) onto the 
ancillary system, and rotate the first system so that the mea- 
surement in the eigenbasis {|toi), I7772), • . •} considered be- 
fore becomes a measurement of B directly. Formally, one 
can define U = (C/r ® l).[/copy and Ma = J2 f {rn)\m){Tn\, 
where C/copy is a unitary such that Ucopy\'m,£^) ~ |to,to) for 
all basis vectors |?77) and C/r is a unitary such that U^BUr = 
^g(TO)|TO)(777| (where the non-yet-prescribed values g{in) 
are chosen so that the numbers of +1 and —1 values are the 
same as the numbers of +1 and —1 eigenvalues of B, for such 
a unitary [/r to exist). One then obtains the same values for 
e_4 and eg as before, with the direct measurement of lip) in the 
eigenbasis {|to)}. 



^ One can use here in particular tlie facts tliat {m2\tp) = (rrai|mi) — 
1, (rn2\rm) = ((rriT|mr> - l)[(mr|rnr> + ^^(i _ \{AoBo)\^)], 
CmlAoW = sa{l - |(AoBo>P) - (mr|Ao|V>, {rn2\Bo\ip) = 
—s/3{l — |(j4o-Bo)P) — (rrai|So|V')i ^nd similar relations with \ms) 
(and with s replaced by — t); furthermore, (rnYlrfrf) — 1 = (1 — 
X2)(52 + r.2 + 2qr^^^^MM§l), (Im(mr|Ao|^»2 = (1 - x'^Kq^ + 



^2 + 2gr ^°<f°^°> )sin2(y-</,'J - (1 



|(Aoi3o>r)(ImQ)2 and 



(Im(mr|iJol^»' = (l-x')(<7'+r2 + 2gr55^dfl^)sin2(^'^-<^)- 



(l-|(Aoi?o>P)(Im/; 



v^ 



Let us finally mention that one can prove in very similar 
ways that our error-disturbance relation (1S20] . valid under the 
assumptions that B^ = B^ ^ 1 and (B) = 0, is also tight 

when|(AoB)| = 1. 



-• Note that under the conditions A^ = B'^ = t and (A) = {B) = 0, 
|(AB)| = 1 always holds for qubits (as in the case considered in the 
example of the main text): indeed, both A\il}) and B\il}) are orthogonal 
to 1 5/1) ; as H is of dimension 2, they are therefore linearly dependent, and 

|(AB>|2 = (A2>(B2) = 1. 
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D. OZAWA'S "UNCERTAINTY RELATION" FOLLOWS 
FROM OUR ERROR-TRADE-OFF RELATION 

We show in this last part that Ozawa's "uncertainty rela- 
tion" for joint measurements, Eq. dHJ llS4ll . follows from our 
error-trade-off relation (|9). Let us indeed write: 

{eAeB + ^BcA + AAesf 
> {ABeA+AAeB^ = AB'^e\+AA^e%+2AAAB e^eu 

> AB\\+AA\l+2^AA^AB^-Cls e^eg > CJb, 

(S35) 

where the last inequality is precisely our error-trade-off rela- 
tion (|9]l. After taking the square-root of the above expressions, 
we obtain Ozawa's relation (O. 

Hence, one can see that Ozawa's relation is sub-optimal 
in two ways. Firstly, one can simply drop the first product 
term, e^eg — which, interestingly, is precisely the term in the 
Heisenberg-Arthurs-Kelly relation (jTji. Secondly, one can de- 
crease the factor AA AB on the second line of Eq. (IS35b 
down to ^J AA^ AB"^ — C|^. We note also that Ozawa's rela- 
tion can only be saturated for e^ = or eg = 0, as otherwise 
the first inequality in (IS35b is strict. 

Similarly, Ozawa's error-disturbance "uncertainty rela- 
tion" (fTTT l llS9ll also follows from our relation (|9). Recall 
that in the case where A^ = B'^ = A^ = B"^ = t and 



{A) = {B) = 0, the strictly stronger relation (ITZt holds; 
in that case Ozawa's relation (fTTT l cannot be saturated, ex- 
cept in the trivial situation where Cab — (which allows 
for eA^m = 0)- 



We note finally that Hall derived in Ref. IIS3I1 a very sim- 
ilar "joint-measurement uncertainty relation" to Ozawa's re- 
lation ([8]), where the standard deviations A^, AB of A and 
B in Ozawa's relation are replaced by the standard deviations 
AA, AB of A and B. Despite the claim in iH, Hall and 
Ozawa's relations are however not equivalent. Hall's relation 
indeed involves more quantities that depend on the particu- 
lar choice of A and B (namely, A^ and AB), and not only 
on A,B and jV'). As a consequence. Hall's inequality does 
not simply follow from our error-trade-off relation (0 — it can 
even give a stronger constraint on e^ and eg, for fixed values 
of A^ and AB. As it is the case with Ozawa's inequality, we 
nevertheless also expect a strictly stronger relation than Hall's 
to hold, when one considers all quantities A^, AS, e^ and eg 
(possibly in addition to AA and AB, which are fixed by A, B 
and \4'))- 

Note added. Hall's relatio n has very recently also been 
investigated experimentally lISlOll . together with Ozawa's 
relation (O and another relation derived by the authors 
of lISlOll . Their experiment also demonstrated a violation of 
the Heisenberg-Arthurs-Kelly relation ^. 
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